Characterizing the Performance Space of Shared Memory Computers Using Micro-Benchmarks

نویسندگان

  • Rafael H. Saavedra
  • R. Stockton Gaines
  • Micheal J. Carlton
  • R. Stockton
  • Michael J. Carlton
چکیده

We have been investigating alternatives to conventional benchmarking that will allow users and machine designers to characterize the performance space of the memory hierarchy and the interconnection network of different shared memory machines in a uniform way, while providing a level of detail that is usually associated with hardware monitors. Our approach uses micro benchmarks (small benchmarks each measuring a particular aspect of the system under controlled conditions) and the concept of a Multiprocessor Memory Reference Pattern (M-MRP). A M-MRP is a family of memory reference sequences generated by a subset of processors. Each M-MRP is characterized by 5 different attributes: 1) the number of processors generating addresses; 2) the region of space covered by each processor; 3) the distance between consecutive addresses; 4) the degree of contention on the data; and 5) the data dependencies between the processors. By varying each of these five dimensions we can exercise the system and the individual components in a specific and predictable way. We then measure the performance response of the system to a particular micro benchmark. Using our methodology we have been able to obtain detailed measurements about the performance characteristics of the memory hierarchies and interconnection networks of the KSR1 and Stanford DASH under many different conditions. Our results clearly show for each machine how the network latency changes in response to different communication patterns and degree of data contention.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

JIAJIA: A Software DSM System Based on a New Cache Coherence Protocol

This paper describes design and evaluation of a software distributed shared memory (DSM) system called JIAJIA. JIAJIA is a home-based software DSM system in which physical memories of multiple computers are combined to form a larger shared space. It implements the lock-based cache coherence protocol which totally eliminates directory and maintains coherence through accessing write notices kept ...

متن کامل

Characterizing Application Performance Sensitivity to Resource Contention in Multicore Architectures

Contention for shared resources in the memory hierarchy can have a profound effect on the performance of applications running on high-end computers based on commodity multicore microprocessors. In this paper we describe a methodology, based on differential performance analysis, to quantify the effect of this contention on parallel applications. In particular, we characterize the contention for ...

متن کامل

Loop Parallelism on Tera MTA Using Sisal

The difficulty of programming parallel computers has impeded their wide-spread use. The problems are caused by existing hardware and software tools. The software problems on shared-memory and vector computers can be solved by using deterministic high-performance functional languages like SISAL. Distributed-memory computers have even more obstacles than shared-memory parallel machines. Research ...

متن کامل

Automatic mapping of parallel applications on multicore architectures using the Servet benchmark suite

Servet is a suite of benchmarks focused on detecting a set of parameters with high influence on the overall performance of multicore systems. These parameters can be used for autotuning codes to increase their performance on multicore clusters. Although Servet has been proved to detect accurately cache hierarchies, bandwidths and bottlenecks in memory accesses, as well as the communication over...

متن کامل

Performance Modeling and Measurement of Parallelized Code for Distributed Shared Memory Multiprocessors

This paper presents a model to evaluate the performance and overhead of parallelizing sequential code using compiler directives for multiprocessing on distributed shared memory (DSM) systems. With increasing popularity of shared address space architectures, it is essential to understand their performance impact on programs that benefit from shared memory multiprocessing. We present a simple mod...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1993